Modelling Vague Content and Structure Querying in XML Retrieval with a Probabilistic Object-Relational Framework
نویسندگان
چکیده
Many XML retrieval applications require relevance-oriented ranking of retrieved elements in order to capture the vagueness inherent to the information retrieval process. This relevance-oriented ranking should not only support vagueness at the content level, but also at the structural level. In this paper, we use a probabilistic object-relational framework to model representation and retrieval strategies that take into account vagueness at both content and structure level. Our approach makes use of established database technology combined with sound probability theory, thus allowing for fast and flexible prototyping of various representation and retrieval strategies.
منابع مشابه
An Algebra for Probabilistic XML Retrieval
In this paper, we describe a new algebra for XML retrieval. We first describe how to transform an XPath-like query in our algebra. The latter contains a vague predicate, about, which defines a set of document parts within an XML document that fullfill a query expressed as in “flat” Information Retrieval – a query that contains only constraints on content but not on structure. This predicate is ...
متن کاملAn Attribute-based Model for Semantic Retrieval
This paper introduces a knowledge-oriented approach for modelling semantic search. The modelling approach represents both semantic and textual data in one unifying framework, referred to as the probabilistic object-relational content modelling framework. The framework facilitates the transformation of “term-only” retrieval models into “semantic-aware” retrieval models that consist of semantic p...
متن کاملModels for Integrated Information Retrieval and Database Systems
In this paper, we show that there is a mismatch between information retrieval (IR) and database (DB) concepts, and we devise solutions for this problem. DB oriented approaches have to distinguish between the logical and the content structure of objects, and should also consider the layout structure. Data independence—not regarded in IR before—can be achieved by using the notion of vague predica...
متن کاملVague Content and Structure (VCAS) Retrieval for Document-centric XML Collections
Querying document-centric XML collections with structure conditions improves retrieval precisions. The structures of such XML collections, however, are often too complex for users to fully grasp. Thus, for queries regarding such collections, it is more appropriate to retrieve answers that approximately match the structure and content conditions in these queries, a process also known as vague co...
متن کاملThe MPEG-7 Multimedia Database System (MPEG-7 MMDB)
Broadly used Database Management Systems (DBMS) propose multimedia extensions, like Oracle’s interMedia. However, these extensions lack means for managing the requirements of multimedia data in terms of semantic querying, advanced indexing, content modelling and multimedia programming libraries. In this context, this thesis presents a methodology for enhancing extensible ObjectRelational Databa...
متن کامل